Change Streaming Imaging (Processing) Job Options in QC
A Streaming Discovery job combines a Discovery Job and either a Data Extract Job or a Processing Job. This topic explains how to change the Imaging (Processing) Job options in the QC module.
In the eCapture Controller you define the Streaming Discovery Job options at the Case (Project) level, or for individual Streaming Discovery Jobs. On the Streaming Discovery Job options dialog:
-
On the Imaging tab of the Streaming Discovery Job options dialog, if the Enable Imaging check box is unchecked, the job associated with the Streaming Discovery Job is a Data Extract job.
-
On the Imaging tab of the Streaming Discovery Job options dialog, if the Enable Imaging check box is checked, the job associated with the Streaming Discovery job is a Processing Job. A Streaming Imaging job is, essentially, the Processing Job associated with a Streaming Discovery Job.
A Streaming Imaging job may be opened in QC after it is completed and loaded.
|
Note: The Discovery Job associated with a Streaming Discovery Job can also be opened in QC after it is completed and loaded. |
- If the documents were autoloaded (after reprocessing) to the review platform (IPRO Eclipse or Relativity), a new load file is created containing new output. The new load file is sent to the review platform.
-
If a document was not exported after reprocessing, the images are replaced. However, a load file is not created because the new output will be used when the document is eventually exported. If a load file is created and sent to the review platform, an Autoloads dialog appears containing information about the autoload and its status. An example is shown in the following figure.
The autoloads status is checked and updated every 5 seconds. The Autoloads dialog cannot be closed and will contain all autoloads initiated by the specific particular QC instance (determined by the time the QC executable is running). Autoloads are not persisted between QC instances. They are started as a part of the reprocess operation. QC can be safely closed as long as the import has started.
For more information about autoloading into IPRO Eclipse, see Autoload into IPRO Eclipse - Data Extract Job and Autoload into IPRO Eclipse - Processing Job.
-
At the bottom of the Document List window, click in the QC Functions toolbar to open the Options for Streaming Imaging job. The Processing Job Options dialog box appears.
-
Click on the General, Excel, Word, and PowerPoint Options tabs to set the Processing Job options for reprocessing the Streaming Imaging job. Review the Streaming Discovery: Imaging Options link below for more information on Processing Job settings. When finished, click OK to exit the dialog.
Note: After you change the Processing Job options, when you reprocess selected files in the Documents List window in the QC module, the files will be reprocessed using the modified Imaging (Processing) Job options.
Click on Streaming Discovery: Imaging Options below for more information on the Streaming Imaging (Processing) Job Options. To learn more about the QC Flags applicable to Streaming Discovery, click on QC Flags for Streaming Discovery.
Streaming Discovery: Imaging Options
Streaming Imaging options are defined on five different tabs, General, Excel, Word, PowerPoint, and Placeholder. See the following sections for more information.
|
Important: To define imaging options for the Streaming Discovery Job, you must first select the check box Enable Imaging located on the General tab. Once selected, the imaging options display on all five tabs. |
Streaming Discovery Imaging: General Options
- Click the General Options tab.
-
Set the OCR options.
Note: If you are setting Case (Project) Level options, OCR and Time Zone Handling options are defined on the Common Options tab because Processing Jobs and Data Extract Jobs use the same OCR and Time Zone Handling options. For more information about setting options at the Case (Project) level, see Create a New Case (Project).
- OCR pages missing text - Select OCR Pages missing text to OCR pages within documents that are missing text. Optionally, select PDF page character threshold and indicate a value. The default value is 25 characters. The maximum value is 10000. If the value is less than 25, eCapture will send the page to be OCRed. If necessary, enter a different value.
- PDF page character threshold - Select a PDF page character threshold and indicate a value. The default value is 25 characters. If the value is less than 25, eCapture will send the page to be OCRed. If necessary, enter a different value.
- Minimum average OCR confidence - The level range settings are from 1 up to 100. The default is 50. The OCR Confidence Level is the average of confidence per document, for all pages within a document on which OCR was performed. Success or failure of a document for flagging is based on the average confidence level of the document. If the average confidence level is below the selected threshold, the document will be flagged in QC with the OCR Low Confidence Flag.
-
OCR languages - eCapture includes multi-language OCR capability. The QC document will contain the original OCR languages that were selected for the Data Extract job. A valid multi-language OCR license must be available in order to modify the original selected languages, if necessary.
To reserve a portion of the multi-language OCR licenses for QC and to keep the Worker from consuming all available licenses, use the Multi-Language OCR License slider located in the Controller System Options dialog box.
Click OCR Languages to display the Language OCR dialog box.
After selecting the languages, click OK to close the dialog box. The selected languages appear in the OCR Languages field. Place the mouse pointer on the OCR Languages field to display a tool tip that lists all the selected languages that were not visible in the OCR Languages field. The OCR Languages field is a read-only field.
Click here to view some caveats to OCR Language handling.
English is the only language that is selected by default. The more languages that are selected, the lower the confidence level will be for correctly identifying the languages in a document.
-
If English is selected, Arabic will not be available for selection.
-
If Arabic is selected, all other languages will not be available for selection.
-
If one of the CJK (Chinese, Japanese, Korean) languages are selected, then all remaining CJK languages will not be available for selection. Other languages (excluding Arabic) may be selected.
-
If Chinese Simplified is selected, Chinese Traditional, Japanese, and Korean will not be available for selection.
-
If Chinese Traditional is selected, Chinese Simplified, Japanese, and Korean will not be available for selection.
-
If Japanese is selected, Chinese Simplified, Chinese Traditional, and Korean will not be available for selection.
-
If Korean is selected, Chinese Simplified, Chinese Traditional, and Japanese will not be available for selection.
-
- Set the Color Depth, Paper Size, and other basic options.
General Color Depth - Applies to everything else outside of the five types (Word, Excel, PowerPoint, PDf, and Native TIFF) that eCapture does not process through Oracle (formerly Stellent). There are three exceptions to this rule: Lotus Notes, Internet Explorer, and Outlook Express; which also fall under the General type. All other email, except for Lotus Notes and Outlook Express at this time, are always Group 4 TIFF because it is rendered from text.
Single Page Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JPEG
Multi-Page TIFF Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JTIFF - (JPEG compressed TIFF)
Image Color Depth - Applies to: BMP, TIFF, PCX, GIF, WPG, WINDOWSICON, WINDOWSCURSOR, MACPAINT, CGM, DCX, SUNRASTER, KODAKPCD, PNG, DGN, PBM, and ADOBE PHOTOSHOP. However, if Lead fails to open a file, it then goes to Oracle (formerly Stellent) and uses the General Color Depth options.
Image Color Depth Options
Rendered as
As Is
If Original is Black&White, then Group 4 TIFF; otherwise, it will be a JPG matching bit depth.
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
True Color (24-bit)
JPG
PDF Color Depth - Select a PDF Color Depth. A PDF always uses the selected color depth setting in the PDF area. There are two possible outcomes:
Successful Use of the Adobe Library
PDF Color Depth Options
Rendered as
As Is
If Original is Black&White, then Group 4 TIFF; otherwise, it will be a JPG matching bit depth.
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
JPG (8-bit)
True Color (24-bit)
JPG
Unsuccessful Extraction of the Adobe Library
PDF Color Depth Options
Rendered as
As Is
Always 24-bit JPG
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
JPG (8-bit)
True Color (24-bit)
JPG
- PDF Paper Size - Select an output paper size for PDFs. When the As Is option is selected, the internal PDF document size is used to draw the image.
Paper Size - Click the drop-down menu and select an output paper size for documents during processing.
- Image to PDF - When this option is selected, the system creates a PDF of the reprocessed document and places it in the Output directory with a .PDF extension.
Max Page Threshold - Set a Max Page Threshold (1 to 10000) if you want to limit the number of pages produced by larger files. By default, this option is not selected. If the Page Threshold is reached, the items are not flagged as exceptions, but flagged as Page Threshold Exceeded. All pages processed up until the threshold is reached are included in the document. The first page is the Page Threshold Exceeded placeholder, and subsequent pages are those that were processed within the Max Page Threshold setting.
Placeholder pages over threshold - Select this option to apply a placeholder to pages exceeding the threshold value indicated in the Max Page Threshold field.
Text handling - On the drop-down menu, choose either:
Truncate text to max pages - text is truncated to match the output of pages that fall under the threshold (existing behavior).
Retain all text for document - document text is associated to the number of pages below the set threshold value and all subsequent pages are blank.
-
Set the Time Zone Handling, as appropriate.
- Convert all times to UTC
- Specify Time Zone
For more information about Time Zone Handling, see How eCapture Handles Dates and Time Zones.
Note: If you are setting Case (Project) Level options, OCR and Time Zone Handling options are defined on the Common Options tab because Processing Jobs and Data Extract Jobs use the same OCR and Time Zone Handling options. For more information about setting options at the Case (Project) level, see Create a New Case (Project).
-
Click Advanced Options to set more complex General Options rules. The Advanced Imaging dialog box appears.
- Remove blank pages - Select this option and then set the Blank page threshold (1 to 2000) to a value that eliminates the speckles without eliminating any punctuation marks from the pages. eCapture removes any images that have fewer "dots" than this threshold. If this setting is too high, you may lose images with a few short words. We suggest a setting of 50 as a starting point.
- Process CSV files with Microsoft Excel - Select this option to process CSV files by using Microsoft Excel instead of Oracle (formerly Stellent).
- Process HTML files with Internet Explorer - Select this option to process HTML files by using Internet Explorer instead of Oracle (formerly Stellent).
-
Enable internet links in emails - This option controls whether inline images are downloaded from the internet. Clearing this option can improve performance on environments that do not have internet access.
-
Set Lotus Notes options, as appropriate:
- High Speed (Optimized for speed)
- Medium Speed (Balance of speed and quality)
- Low Speed (Optimized for highest quality output)
-
Click the Outlook/EML link, Select Handling/Order. The Outlook/EML Text Cutoff Handling dialog box appears. Select an option and click either the or to move it to a specific order location. Repeat for additional options. Options include:
-
Attempt in Landscape w Shrink to Fit
-
Attempt in Portrait w Shrink to Fit
-
Attempt in RTF
-
Attempt in Text
-
Assign Text Cutoff Flag and Manage in QC - This is the default setting. It cannot be repositioned.
-
-
Click the Lotus Notes link, Select Handling/Order. The Lotus Notes Text Cutoff Handling dialog box appears. Select an option and click either the or to move it to a specific order location. Repeat for additional options. Options include:
-
Attempt in Landscape
-
Attempt in Text
-
Assign Text Cutoff Flag and Manage in QC - This is the default setting. It cannot be repositioned.
-
- Click OK to exit the Advanced General Options dialog box.
Streaming Discovery Imaging: Excel Options
-
Click the Excel tab to set the processing options for Excel files.
-
Process with Outside-In (Stellent) - Selecting this option to:
- Allow for faster and more consistent generation of images on the first pass
- Reduce the amount of time spent manually QCing these document types
When selected, only Outside-In (Stellent) is used to process images; the Microsoft related options are grayed out by default. Full metadata is extracted and time zone imaged output reflects the time-zone handling options configured for the Processing Job. All files processed by Outside-In (Stellent) receive the Stellent Processed flag in QC.
The processing output differs when using Outside-In (Stellent) to view and image documents. However, the QC applied flags, metadata, and optional summary reports are similar if processing was done without Outside-In (Stellent). Other processing options, including Flex Processor processing options, are respected when using Outside-In (Stellent).
- Comments - Set where you want comments displayed. Select from None, At end of sheet, or As displayed on sheet.
-
Color Depth - Set the Color Depth options. Color processing for Excel files is handled separately from color processing of other types of files. This setting is independent of the General Color Depth.
Single Page Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JPEG
Multi-Page TIFF Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JTIFF - (JPEG compressed TIFF)
-
Paper Size - Click the drop-down menu and select an output paper size for documents during processing.
Note: For Excel Only - For Custom[8.5x11.0in], the Custom Paper Size dialog box appears.
The Custom Paper size defaults to 8.5x11 inches. The range values are shown for both Units: Inches and Millimeters. Maximum size in Inches 50.00x70.00; for Millimeters 1270.00x1778.00. When this option is selected, the document will be processed through the PDF driver (Text-Based PDF creation) regardless of the Flex Processor option selected. OCRing is not applicable in this instance. Export settings will be limited to Text-Based PDF Output only, even if image format is selected. Non-Excel documents will export as usual.
-
Center on Page - Determines where to center the image on the page.
-
Horizontally
-
Vertically
-
-
Page Order - Determines the page order to be used for imaging.
-
As is
-
Down, and then over
-
Over, and then down
-
-
Orientation - Determines the orientation of the page at the time of printing.
-
As is
-
Portrait
-
Landscape
-
-
Scaling - Specifies whether or not the image should be scaled and how. If scaling is used the options are adjusted to a percentage of the current size, or is modified to fit the page.
-
As is
-
Adjust to % normal size
-
Fit to page
-
-
If you want to set more granular processing options for Excel files, click the Advanced Options button. The Advanced Excel Imaging dialog box appears.
-
At the top of the dialog box, set the options for how to handle headers, footers, and other content in the Excel workbook. Click the Defaults button to revert to the default settings for these options, as shown in the following image:
If you have trouble locating the referenced options in Excel, click here to view information about how to navigate in Excel to the option.
-
Do not include headers - View > Header and Footer: Header/Footer Tab, Header drop-down list, None
-
Do not include footers - View > Header and Footer: Header/Footer Tab, Footer drop-down list, None
-
Reveal hidden columns - Format > Column > Unhide
-
Reveal hidden rows - Format > Row > Unhide
-
Unhide worksheets - Format > Sheet > Unhide
-
Unhide very hidden worksheets - Unhides worksheets that were hidden by a Microsoft Visual Basic for Applications program that assigned the property xlSheetVeryHidden. (From the Microsoft Excel Help File: If sheets are hidden by a Microsoft Visual Basic for Applications program that assigns the property xlSheetVeryHidden, you cannot use the Unhide command to display the sheets. If you are using a workbook with Visual Basic macros and have problems with hidden sheets, contact the owner of the workbook for more information.)
-
Autofit columns - Double click the right boundary of the column heading for that row.
-
Autofit rows- Double click the boundary below that row heading.
- Wrap text - Format > Cells: Alignment Tab, Wrap Text Option.
-
Print gridlines - File > Page Setup: Sheet Tab, Under Print, select Gridlines check box.
-
Unhide windows - Window > Unhide.
-
Apply Autofilter - Data > Filter > AutoFilter
- No fill color (for cells) - Format > Cells: Patterns Tab, Under Color, click No Color.
-
Clear print area - File > Print Area > Clear Print Area.
-
Clear print title columns - File > Page Setup: Sheet Tab, under Print Titles select the columns to repeat range.
-
Clear print title rows - File > Page Setup: Sheet Tab, under Print Titles select the rows to repeat range.
-
Display headings - File > Page Setup: Sheet Tab, under Print, select the Row and column headings check box.
-
Expand Pivot Tables - Right click Pivot Table to display context menu. Choose Expand/Collapse > Expand.
-
-
Set the remaining settings in the Advanced Excel Imaging dialog box.
The following table provides a list of the available options.
Setting
Options
Date field handling:
-
Replace with date created - will replace with creation date.
-
Replace with date last saved - will replace current date with last saved dated.
-
Replace with comments - displays the Date Field Comments field where you can enter the text that should replace the contents of the date field.
-
Replace with field code
-
Do not replace - will not replace the date (e.g., Macros)
Header/Footer Filename field handling
If path or filename options are found in an Excel header or footer, you can select from the following options to handle these occurrences.
-
Replace with filename (no path) - inserts the unqualified filename
-
Replace with filepath - inserts the fully-qualified path of the original file
-
Replace with comments - displays the Header/Footer Filename field comments field where you can enter your own comments
-
Replace with field code - replaces outputs &[Path] and/or &[File]
-
Remove - removes the codes entirely
Generate metadata
Select Generate a metadata summary images for each Excel spreadsheet, and then under Spreadsheet Metadata Summary Options select the individual types of metadata to capture.
-
Document Properties
-
Comments
-
Formulas
-
Linked Content - The data collected will include hyperlinks and OLE linked files. If any linked content exists in a document, a QC flag will be added. A separate page entitled Document Properties is generated and is placed at the end of each Microsoft Excel document.
For more information about metadata, click here.
Who creates the metadata? The native program (such as Microsoft Excel or Outlook) creates the metadata and maintains it with the native file (the letter or email).
What does eCapture do with this data? When a document is processed, the metadata is collected from the document and stored in the database.
How is metadata useful? It gives you valuable information as to “Who knew what, and when.” It can tell you who wrote a document and who edited it last. It also shows you a file’s revision number, the character count, and many other pieces of information about a file summary image for each Excel spreadsheet.
Blank page removal
This option is available if the Remove Blank Pages option is selected under the General Options tab. Select from the following two options to remove blank pages:
-
Based on selected Page Order: Down, then over or Over, then down.
-
If Down, then over is selected, all vertical page columns that are blank will be removed.
-
If Over, then down is selected, all horizontal page rows where all pages in a horizontal run are blank will be removed.
-
-
Based on both Page Order options: This bases the removal of blank pages on both horizontal page-rows and vertical page-columns.
Example of Page Removal
The following example pertains to using a spreadsheet with 12 pages that will be rendered.
-
If the sheet's page order is Over, then down, eCapture removes all horizontal page rows where all pages in a horizontal run are blank. In order to do that, eCapture steps through all HPageBreaks and makes sure the range from the first column to the last column is blank.
-
If eCapture determines that 1-3 is blank, then they will be hidden. If eCapture determines that 4-6 is blank, then they will be hidden, and so on.
-
If the sheet's page order is Down, then over, eCapture will remove all vertical page columns that are blank.
-
If eCapture determines that 1-A is blank, then they will be hidden. If eCapture determines that 2-B is blank, then they will be hidden, and so on.
By using this algorithm, all blank pages will not be eliminated, though many of them will be.
Note: All page-hiding is done by setting horizontal regions' RowHeight properties and vertical regions’ ColumnWidth properties to 0.
-
- Click OK to exit the Advanced Excel Imaging dialog box.
Streaming Discovery Imaging: Word Options
-
Process with Outside-In (Stellent) - Selecting this option:
- Allows for faster and more consistent generation of images on the first pass
- Reduces the amount of time spent manually QCing these document types
When selected, only Outside-In (Stellent) is used to process images; the Microsoft related options are grayed out by default. Full metadata is extracted and time zone imaged output reflects the time zone handling options configured for the Processing Job. All files processed by Outside-In (Stellent) receive the Stellent Processed flag in QC.
The processing output will differ when using Outside-In (Stellent) to view and image documents. However, the QC applied flags, metadata, and optional summary reports will be similar if processing was done without Outside-In (Stellent). Other processing options, including Flex Processor processing options, are respected when using Outside-In (Stellent).
-
Select the option Show Hidden Text to see hidden text, if any, contained in Word documents.
-
Select the appropriate revision option. The option you select determines how the system handles revisions within Word documents.
-
As is - Print the document as it is according to the Office Settings on the computer.
-
Detail Revisions - Print the document with revisions shown.
-
Final Copy (hide revisions) - Print the document with no revisions shown.
-
Both Copies - Documents are printed. If a document has revisions, it's printed again with the revisions shown. Documents with revisions will then have two sets of images, one right after the other.
-
-
Select the appropriate orientation option. The option you select determines how the system orients images of Word documents.
-
As is
-
Portrait
-
Landscape
-
-
Select the Scale to Page option to scale the contents of the page to fit in the printable area. This sets the PrintZoomPageWidth and PrintZoomPageHeight to the paper size of the printer when printing Word documents.
-
Color Depth - Color processing for Word documents is handled separately from color processing of other types of files. This setting is independent of the General Color Depth options located in the Processing Options: General Options tab.
Single Page Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JPEG
Multi-Page TIFF Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JTIFF - (JPEG compressed TIFF)
- Select the appropriate Paper Size for Word documents.
- If you want to set more granular options for handling of Word documents, click the Advanced Options button.
In the Field Handling section, select the Date Field Handling options:
Replace with date created - will replace with creation date.
Replace with date last saved - will replace current date with last saved dated.
Replace with comments - displays the Date Field Comments field where you can enter the text that should replace the contents of the date field.
Replace with field code
Do not replace - will not replace the date (e.g. Macros)
Remove - removes the codes entirely.
In the Field Handling section, select the Filename handling options:
Replace with filename (no path)
Replace with filepath
Replace with comments - displays the Filename Comments field where you can enter the text that should replace the filename
Replace with field code
Do not replace
Set the metadata options for Word documents
Select Generate metadata. The native program, in this case Word, creates the metadata and maintains it with the native file. When a document is processed, the metadata is collected from the document and stored in the database. Metadata gives you valuable information as to “Who knew what, and when.” It can tell you who wrote a document and who edited it last. It also shows you a file’s revision number, the character count, and many other pieces of information about a file.
Select the individual types of metadata to capture under Document Metadata Summary Options:
Document Properties
Revisions
Comments
Routing Slips
Linked Content - The data collected will include hyperlinks and OLE linked files. If any linked content exists in a document, a QC flag will be added.
A separate page entitled Document Properties is generated and is placed at the end of each Microsoft Word document. For example, The Document Properties page may contain the following data:
Title, Author, Company, Attached Template, Page count, Paragraph Count, Line Count, Word Count, Character Count (spaces excluded), and Character Count (spaces included).
- When finished setting Advanced Options, click OK to exit the Advanced Word Imaging dialog box.
- When finished setting Word Options, click OK to exit the Options for Processing dialog box or click one of the other tabs to set options for other types of files.
Streaming Discovery Imaging: PowerPoint Options
-
Select Original Settings (As Is) to use Microsoft PowerPoint’s default settings.
-
Select the Page Orientation. The options are: As is, Portrait, and Landscape.
-
Select the Slide Orientation. the options are: As is, Portrait, and Landscape.
-
Select the Color Depth to be used for processing PowerPoint presentations. Color processing for PowerPoint presentations is handled separately from color processing of other types of files. This setting is independent of the General Color Depth options located in the Processing Options: General Options tab.
Single Page Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JPEG
Multi-Page TIFF Output Type
General Color Depth Options
Rendered as
Black&White (1-bit)
Group 4 TIFF
Grayscale (8-bit)
LZW TIFF
256 Color (8-bit)
LZW TIFF
True Color (24-bit)
JTIFF - (JPEG compressed TIFF)
-
Select the Output Type. The options are: Slides, Outline, Notes Pages (notes and slide on one page), Notes Pages Split (notes and slide on separate page), or Handouts.
-
Select a Slide Size. Choose a slide size or As Is from the drop-down menu.
-
Select an output Paper Size or As Is from the drop-down menu.
-
To select more complex PowerPoint options, click the Advanced Options button.
- Print Hidden Slides - Select this option to print slides that are hidden from the slide show.
- Print Comments - Select this options to print comments for your slides.
- Frame Slides - Selecting this option prints a border around each slide.
- Scale to Fit Page - Select this option to ensure all available text appears on the slide that was imaged from eCapture,
-
Handouts - Select the desired handout options:
-
Slides per Page
-
Order (if generating 4 or more slides per page)
-
- Include Linked Content Summary - Select this option to ensure that the data collected includes hyperlinks and OLE linked files. If any linked content exists in a document, a QC flag is added.
-
Headers and Footers - For Headers and Footers, you can set options for Slides or Notes & Handouts. The tabs that display are based on the Output Type selected on the basic PowerPoint Options tab. The options are: Slides, Outline, Notes Pages (notes and slide on one page), Notes Pages Split (notes and slide on separate page), or Handouts.
Slides: For the Output Type of Slides, select from the following options from the Slide Tab:
- Select Date and Time if you want the page header to list the Date last saved or the Date created at the top of the image.
-
If Date and Time is selected, you can select the Update Automatically option. Select Date last saved or Date created.
-
Format: Select a format option for the date and time.
-
Select Fixed if you want to manually enter a fixed date and time in the image header.
- Select Footer if you want a footer at the bottom of the image.
-
If Footer is selected, enter static text that you want printed at the bottom of the image or check As is to maintain the existing footer for the slide.
- If Footer is selected, select a Slide Number option to define whether a slide number should show on the image. The options are: As is, Show, Do not show.
-
If Footer is selected, select a Show on Title Slide option to define whether to show the footer on the title slide image. The options are: As is, Show, Do not show.
Other than Slides: If, on the basic PowerPoint imaging options tab you set the Output Type to anything other than Slides, select from the following options on the Notes and Handouts tab:
- Select Date and Time if you want the notes/handouts to list the date/time.
-
If Date and Time is selected, select the Update Automatically option: Select Date last saved or Date created.
- Format: Select a format option for the date and time.
-
Select Fixed if you want to manually enter a fixed date and time in the image header.
-
Select Header if you want a header at the top of the image. You can either enter a fixed text to add or check the As Is option to maintain the existing headers.
-
Select Footer if you want a footer at the bottom of the image.
-
If Footer is selected, you can enter static text that you want printed at the bottom of the image.
- If Footer is selected, select a Page Number option to define whether or not a page number should show on the image. The options are : As is, Show, Do not show.
- Click OK to exit the Advanced PowerPoint Options dialog box.
- Click OK to exit the Options for Processing dialog box, or click one of the other tabs to set options for other types of files.
Streaming Discovery Imaging: Placeholder Options
- Click the Placeholder tab.
-
Click the button to create a new placeholder. The Create New Placeholder dialog box appears.
-
Enter a Placeholder Name. When you are finished creating your placeholder, the Placeholder Name will display in the Placeholder grid located in the Placeholder tab.
-
Select the check boxes next to the File Types/Extensions for which you want to have placeholders when a Streaming Discovery Imaging Job runs. By default, all File Types/Extensions are unselected.
- Click Select All to select all file types.
- Click Clear All to clear the selections and individually select the desired file types.
-
Expand a file type to view its subcategories. Filtering may be done on specific subcategories of a file type.
eCapture recognizes documents by their actual content and not the file extension. Keep this in mind as you exclude/include file types. You can filter (exclude) a myriad of file types by simply ensuring that the File Types/Extensions are unselected. When the Job runs, it will create placeholders for only those file types that are selected. These file types are based on the Oracle’s Outside-In identification criteria.
- If you want to add more file extensions to the placeholder definition you are creating, in the Placeholder these File Extensions list box, click to add the extension to the list. At least one file type or category must be selected. Repeat this step for each extension. File extensions are automatically alphabetized.
- If you want to remove a file extension, in the Placeholder these File Extensions list box, select the extension and click .
- If you want to clear all the extensions from the list, in the Placeholder these File Extensions list box, click .
-
To import a list of file extensions from a CSV file, in the Placeholder these File Extensions box.
- Click .
- Select the CSV file.
- Click Open. An Import From File progress bar appears. If any errors were encountered during the import, such as duplicates, an Information dialog box displays with the errors. The CSV file may contain extensions with or without a "." (period). Ensure the CSV file contains only one column of file extensions with each extension occupying its own row, for example, Range A1 through A50 or Range E1 through E50. The file extensions are alphabetized as they are imported.
- Set the File Size parameters. The default setting is None. If specified, file sizes may be Over or Under a specified amount. The selected file size applies to the files in the Imaging Job that have sizes on disk that are either greater than or equal to, or less than or equal to, the size specified. The size is expressed in KB. For example, a 1 MB file is entered as 1024 KB.
- Select the Extract Text of Document check box to extract the document text. By default, this check box is cleared.
- Select the Apply Max Page Threshold check box and indicate a threshold value (1 to 10000) to limit the number of pages produced by larger files. By default, this check box is cleared. If the page threshold is reached, the items are flagged as Page Threshold Exceeded. All pages imaged up until the threshold is reached are included in the document. The first page is the Page Threshold Exceeded placeholder, and subsequent pages will be those that were processed within the Max Page Threshold setting.
-
To use a predefined placeholder:
- Click to display the Open dialog box.
- Select a placeholder image. File type options include JPG and TIF.
-
Click Open. The selected image displays in the view box underneath the option.
-
If you want to use a custom placeholder, click .
The Custom Placeholder Configuration dialog box appears.
Complete the necessary fields in the Custom Placeholder Configuration dialog box.
-
Click the drop-down menu located above the Available Fields list and select a specific field type. By default, All Fields is displayed.
-
To narrow the field list:
- To display all fields, delete the value (in this example, the word date), leaving the field empty, and click .
- Click to move a selected field from the Available Fields list to the Selected Fields list.
- Click to move a selected field from the Selected Fields list to the Available Fields list.
- Click to open the Insert Custom Field dialog box in which you can create new group fields and new user fields.
- Use the and arrows to change the order of the fields in the Selected Fields list. Select a field (or contiguous fields) and then use either arrow to reposition the selected field(s).
-
Select a field in the Selected Fields list. The selected field appears in the Font section. Click to open the Font dialog box.
- Select the desired Font, Font Style, and Size; then click OK to return to the Custom Placeholder Configuration dialog box. Repeat this step for each additional field.
-
In the Field Options section, if necessary, select Include labels with values. When selected, both the field label and its value are included.
-
Click . The Date Field Formatting Options dialog box appears.
-
Select the Date Field Formatting and Time Format for the custom placeholder.
-
If you want to change the date field to a different format, click the drop-down menu arrow and select from the following date formats:
-
YYYYMMDD
-
YYYY/MM/DD
-
MMDDYYYY
-
MM/DD/YYYY
-
DD/MM/YYYY
- Otherwise, select the option, Do Not Convert Date Fields.
-
-
If you want to change the Time Format, click the drop-down menu and select from the following options:
-
12-hour [displays time in 12-hour format e.g., 1:04]
-
24-hour [displays time in 24-hour format, e.g., 13:04]
-
Regional [formats the time according to the “default” regional settings of the Worker on which the document is being exported.
Note: Changing the format strings by using the Customize button of Regional Settings will have no effect; the actual region must be changed to see any effect.
-
-
Select Resolve times to second precision if you want to add seconds to all metadata date fields that have time. This does not apply to the images.
- By default, the Legacy Date Field Formatting check box is cleared. Clear this option to select from the Invalid date options and to select fields for date format handling.
-
-
If you cleared the Legacy Date Field Formatting check box, set the Invalid date options:
- Treat date values outside of specified range as invalid dates - Select this check box and then select a Start Date and End Date range. Any dates outside of the selected range are considered as invalid dates. The start date default is set to SQL minimum date. The end date default is set to SQL maximum date.
-
Choose one of the following options:
-
Invalid date field output value - enter text to display if an invalid date is encountered. This field may be left blank.
-
Invalid date field output do not convert - invalid dates will be output as a text field.
-
-
From the Available Fields list, select the fields you want to use for date formatting and move them to the Fields Selected for Date Format Handling list. There are a few considerations about date fields to keep in mind:
-
The only fields that are not present in the list are *DATE_ONLY* and *TIME_ONLY*. The fields in the Available Fields list comprise those that are marked as valid for date formatting. This is determined by the value of TRUE in the ExportAttemptDateParse field located in the EncounteredMetatdataFieldList table.
-
Date field formatting options affect only those fields in the Fields Selected for Date Format Handling list.
-
Date field formatting options are set at the Job level.
To select fields for date format handling.
-
Select a field for date format handling by selecting the field from the Fields Available for Date Format Handling list and clicking to move the single field to the Fields Selected for Date Format Handling list.
-
For two or more fields, Ctrl-click to select non-contiguous fields or Shift-click to select contiguous fields. After selecting the fields, click to move them to the Fields Selected for Date Format Handling list.
eCapture creates two additional fields that “split” the date and time into a Date Only field and a Time Only field. These two additional fields are displayed in the Available Fields list in the Export Wizard, Select Export Fields screen. For example, if the DueDate field was moved to the Fields Selected for Date Format Handling list, the following additional DueDate fields would display in the Available Fields list: DueDate*DATE ONLY* and DueDate*TIME ONLY*.
-
- When you are finished setting the Date Field Formatting options, click OK. The Custom Placeholder dialog box appears.
-
In the Placement Options dialog box, select the placement settings for the placeholder.
-
Set the alignment positioning for the placeholder.
-
Vertical Alignment: Determines placement along the vertical axis. Options include Top, Center, or Bottom. Top is the default.
-
Horizontal Alignment: Determines placement along the horizontal axis. Options include Left, Center, or Right. Left is the default.
-
- Set the Indentation (Left and Right) for the placeholder. This setting determines the horizontal spacing to the left or right of the page margins.
- Set the Truncation for the placeholder. Truncation determines the number of characters at which the field value will be truncated. The default value is 128 characters.
-
-
If you want to save your Custom Placeholder formatting to a file, to be used later, click Save, enter a Description for the placeholder, and then click OK. You will also be prompted to save the custom placeholder definition before you exit the Custom Placeholder Configuration dialog box.
-
When you are finished creating the custom placeholder, click OK. The Save Changes dialog box appears.
- Click Yes to save the custom placeholder definition.
- Enter a Description for the placeholder.
-
Click OK.
-
Use the zoom in or zoom out buttons to view the image before finalizing.
- To remove the selected image from the view box, click . The existing image must be removed before selecting a new image.
-
To exit the Custom Placeholder Configuration dialog box, click OK. The placeholder displays in the Placeholder grid.
Note: More than one placeholder may be created for the imaging job. When two or more placeholders exist for a Streaming Imaging job, rule functionality, similar to the Flex Processor, is used. Each placeholder’s document criteria selection is applied in placeholder order with the last placeholder rule (applied to the document) determining the processing output. The Placeholder rule order may be changed before starting the job.
-
The Description field will contain the following values based on the selected Placeholder criteria:
- File Types
- File Types, Extensions
- File Types, Extensions, File Size
-
File Types, File Size
To edit the Placeholder criteria, double-click the Description field of the desired Placeholder. The Edit Placeholder dialog box appears. Make the changes and click to return to the Placeholder grid.
- To delete a Placeholder from the grid, click . A prompt displays to confirm the deletion. Click .
- To change the order of the placeholders in the grid, select a placeholder and click or to move the selected placeholder into the correct position. Repeat this step for all placeholders until they are in the desired order.
QC Flags for Streaming Jobs
The QC Flags Window is located on the right side of the QC Interface (default view) and shows the flags and their corresponding user/system assigned number. The following QC flags are applicable to Streaming Discovery.
- Streaming Discovery Failure: This flag is set if eCapture data extracts a document, and the job is a Streaming Discovery Job that failed. It falls back to eCapture for processing.
- Streaming Discovery Errors Forced Through Export: Identifies parent documents with container-level errors that were forced through processing and Export (via Publish Errors). The QC flag applies to Streaming Discovery Jobs only.
-
Password Applied: Applied to files unlocked successfully with user-defined passwords. Supported file types include Microsoft Word, Excel, PowerPoint, and Adobe PDFs. Unlike the restriction-type QC flags that indicate a native document is password protected, the Password Applied QC flag is wholly independent because it describes a processing activity. Specifically, it indicates whether or not the system utilized a user-configured password to process the native file. This QC flag is not a property of a native document.
Note: The Exception QC flag is not applied for documents flagged with the Password Applied QC flag.
- Protected_Document: This flag is set when encountering encrypted native documents with protected sections, or password protected files for Microsoft Office and Adobe PDF documents. The file may be processed if configured with the requisite password.
- Protected_Content: Indicates that access to portions of the native document is restricted via password protection.
-
Protected_Functionality: Indicates that access to application features or functions is restricted via password protection. Document set to Read Only.
|
Note: For Enterprise Streaming jobs, the metadata field Flag Status was added to the Metadata tab in QC. This metadata field appears based on whether a follow-up flag was clicked for an Outlook message. In Outlook, the default setting is unchecked. When the flag is checked, it changes to red. When the red flag is clicked, it changes to a gray checkmark. The Field Values for the Flag Status field are: 1, 2, and 3 depending on the number of times the follow up flag/checkmark was clicked for the Outlook message. The Flag Status metadata field will not appear in the metadata list in QC if the follow-up flag was not clicked for a message. |
Related Topics
Overview: Create Sessions and QC Items
QC Individual Items in the Documents Window
Change Processing Job Options for a Processing Job in QC
Change Data Extract Job Options for a Data Extract Job in QC